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DETAILED ACTION 
Response to Amendment 

1 . In response to the Office Action mailed February 3, 2004, applicant submitted an 
Amendment filed on March 18, 2004, in which the applicants have amended 
independent claims 1,19 and 27 to include limitations of (1 ) converting each of at least 
two voice commands to text and (2) determining text versions of the at least two voice 
commands by comparing words in the text with phrase grammar rules. 

Response to Arguments 

2. Applicant argues, with regards to now-canceled claim 23, that Logan et al. does 
not teach converting each of at least two voice commands to text and determining text 
versions of the at least two voice commands by comparing words in the text with 
phrase grammar rules. Applicant also argues, with regards to claims 15, 24 and 28, 
that Logan does not disclose or imply phrase grammar rules or composing voice 
commands from words in text (converted from speech), where each voice command 
corresponds to a phrase grammar rule. Applicant's arguments have been considered 
but are moot in view of the new ground(s) of rejection. 
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Claim Rejections - 35 USC § 103 



3. The following is a quotation of 35 U.S.C. 103(a) which fornns the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

4. Claims 1-8, 10-22 and 24-28 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Logan et al. (U.S. Patent No. 5,721,827), hereinafter referenced as 
Logan in view of Ladd et al. (U.S. Patent No, 6,539,359), hereinafter referenced as 
Ladd. 



Regarding claims 1 and 27, Logan discloses a method and an article of 
manufacture, performed on a computer system (column 1 , lines 48-52), for tracking 
time (log time; column 1 1 , lines 41 -51 ) using speech recognition (column 35, line 24), 
the method comprising the steps of: 

accessing speech data (audio speech file; column 2, line 57); 

recognizing at least two voice commands ("Go", "Five", "News", etc.) from the 
speech data, each voice command occurring at a different time (shift to different 
segment; column 12, lines 55-60); 

determining a first time (log file) associated with a first of the voice commands 
(first command "Go", column 12, lines 55-67); and 
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determining a second time (records start of new segment) associated with a 
second of the voice commands ("FIVE", "NEWS", ETC.; column 12, lines 55-67), but 
lacks converting each of at least two voice commands to text and determining text 
versions of the at least two voice commands by comparing words in the text with phrase 
grammar rules. 

Ladd discloses converting each of at least two voice commands (column 4, lines 
9-10 and column 5, lines 9-10) to text (column 4, lines 47-49); and 

determining text versions of the at least two voice commands (column 4, lines 
47-49) by comparing words in the text (VRU server compares input) with phrase 
grammar rules (phonetic rules; column 14, lines 25-45), to control interactive voice 
service. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Logan's invention such that it converts each of at 
least two voice commands to text and determines text versions of the at least two voice 
commands by comparing words in the text with phrase grammar rules, to provide 
pronunciation of words and phrases, and place markers in the text, to control interactive 
voice service (column 16, lines 17-20). 

Regarding claim 2, Logan discloses the method wherein the second voice 
command (BACK command subdivided into two commands) is implied because a 
predetermined time from the first voice command (predetermined time of segment) 
elapses before another voice command occurs (column 14, lines 1 1-27) and wherein 
the step of determining a second time comprises the step of assigning the second time 
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as the predetermined time plus the first time, if the first voice command elapses before 

another voice command occurs (column 11, lines 44-51). 

Regarding claims 3 and 17, Logan discloses the method wherein: 

the speech data (comment) comprises a time stamp (column 42, lines 20-30); 

the step of determining a first time comprises: 

determining an offset time (start and ending offset) between the time stamp and 
a time when the first voice command (audio presentation) is active (column 4, lines 48- 
57); and 

determining (identify) the first time through reference to the time stamp and the 
offset time (column 7, lines 41-45). 

Regarding claim 4, the method wherein: 

the speech data (comment) comprises a time stamp (column 42, lines 20-30); 
the step of determining a first time comprises: 

determining an offset time (start and ending offset) between the time stamp and 
a time when the first voice command (audio presentation) is active (column 4, lines 48- 
57); and 

determining the first time through reference to the time stamp and the offset 
time (column 7, lines 41-45); and 

the step of determining a second time comprises: 

determining a second offset (start and ending offset) time between the time 
stamp and a time when the second voice command is active (column 4, lines 48-57); 
and 
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determining (identify) the second time through reference to the time stamp and 
the second offset time (column 7, lines 41-45). 

Regarding claim 5, Logan discloses the method wherein: 

the step of determining the first time through reference to the time stamp and the 
offset time comprises the step of adding the offset time to the time stamp to determine 
the first time (inherent in segment to be calculated; column 1 1 , lines 36-51 ); and 

the step of determining the second time through reference to the time stamp and 
the second offset time comprises the step of adding the second offset time to the 
time stamp to determine the second time (inherent in segment to be calculated; column 
lljines 36-51). 

Regarding claim 6, Logan discloses the method wherein: 

the speech data (comments and annotations) comprises first and second time 
stamps (column 42, lines 20-30); 

the step of determining a first time comprises: 

determining a first offset time (start and ending offset) between the first 
time stamp and a fime when the first voice command (audio presentation) is active 
(column 4, lines 48-57); and 

determining (idenfify) the first fime through reference to the first time 
stamp and the first offset time (column 7, lines 41-45); and 
the step of determining a second fime comprises: 
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determining a second offset (start and ending offset) time between tine 
second time stamp and a time when the second voice command (audio presentation) 
is active (column 4, lines 48-57); and 

determining (identify) the second time through reference to the second 
time stamp and the second offset time (column 7, lines 41-45). 

Regarding claims 7 and 18, Logan discloses the method further comprising the 
steps of: 

recording speech onto a portable recorder (portable computer or player; column 
6, lines 35-37); and 

loading the speech data from the portable recorder (player) to the computer 
system (CPU), the speech data comprising the speech and a plurality of time stamps 
(time of day clock; column 3, lines 1-18). 

Regarding claim 8, Logan discloses a method and an article of manufacture, 
performed on a computer system for tracking time (log time; column 1 1 , lines 41-51 ) 
using speech recognition, but lacks the method further comprising the steps of: 
determining at least one task name from the text of the at least two voice 
commands. 

Ladd discloses determining at least one task name (tags, symbols, etc.) from the 
text of the at least two voice commands (figure 6 with column 16, lines 17-25 and 34- 
45), to control interactive voice service. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Logan's invention such that it converts each of at 
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least two voice commands to text and determines text versions of the at least two voice 
commands by comparing words in the text with phrase grammar rules, to provide 
pronunciation of words and phrases, and place markers in the text, to control interactive 
voice service (column 16, lines 17-20). 

Regarding claim 10, Logan discloses the method wherein the step of 
determining at least one task name comprises finding the at least one task name (words 
and phrases in the presentation) in the text (column 37, lines 36-45). 

Regarding claim 11, Logan discloses the method wherein the step of 
determining at least one task name comprises associating at least one task name to 
(filename) a time period (time established) between the first and second times, wherein 
the at least one task name is not in the text (user preference; column 6, line 60 - 
column 7, line 7). 

Regarding claims 12 and 16, Logan discloses the method wherein the at least 
one task name comprises two task names, a first task name ("Five") associated with a 
first of the voice commands and a second task name ("News") associated with a 
second of the voice commands, wherein the first and second voice commands occur 
adjacent to each other in time (while), wherein the first and second task name are 
different ("Five" and "News"), and wherein the second voice command is assumed to 
end a first task corresponding to the first task name and start a second task 
corresponding to the second task name (start of new segment; column 12, line 55 - 
column 13, line 2). 
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Regarding claim 13, Logan discloses the method further comprising the step of 
packaging the first time, second time (total time), and one task name (plays field) from 
the at least one task name into a time increment (column 19, line 63 - column 20, line 
7). 

Regarding claim 14, Logan discloses the method wherein the at least two voice 
commands comprises a plurality of voice commands, wherein the at least one task 
name comprises a plurality of task names, and wherein the method further comprises 
the steps of: 

determining an additional plurality of voice command times ("Back", "Back 
Segment" etc.), each of the voice command times associated with one of the plurality 
of additional voice command times ("Skip", "Mark etc.; column 14, lines 11-51); 

converting each of the plurality of voice commands to text (column 15, lines 

4-6); 

determining a plurality of task names (text file) from the text (column 37, lines 
36-45); 

associating a task name with two of the first time, second time, or additional 
plurality of voice command times ("Skip", "Mark etc.; column 14, lines 1 1-51 ); 

creating a plurality of time increments (different program segment; column 12, 
lines 55-56), each time increment comprising two times of the first time, second time, 
or additional plurality of voice command times (receipt of command; column 12, lines 
1 7-23) and a task name (log file; column 1 1 , lines 44-46); and 
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storing the plurality of time increments (host sever; figure 1, element 101 and 
column 4, lines 15-27). 

Regarding claims 15 and 28, Logan discloses a method and article of 
manufacture, performed on a computer system (column 1, lines 48-52), for tracking 
time (log time; column 11, lines 41-51) using speech recognition (column 35, line 34), 
the method comprising the steps of: 

accessing speech data (audio speech file; column 2, line 57) comprising a 
plurality of time stamps and speech (column 42, lines 20-30); 

determining a time associated with each of the voice commands (column 42, 
lines 20-30); 

determining a plurality of tasks, each task associated with at least one of the 
times and at least one of the voice commands ("Skip", "Mark etc.; column 14, lines 11- 
51); and 

determining a plurality of time increments (different program segment; column 
12, lines 55-56), each time increment comprising one of the tasks (log file; column 1 1 , 
lines 44-46) and at least one of the times (receipt command; column 12, lines 17-23), 
but lacks converting the speech to text and composing a plurality of voice commands 
from words in the text, each voice command corresponding to a phrase grammar rule. 

Ladd discloses a method comprising: 

converting the speech to text (column 4, lines 47-49); and 
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composing a plurality of voice commands from words in the text (column 4, lines 
47-49), each voice command corresponding to a phrase grammar rule (phonetic rules; 
column 14, lines 25-45), to control interactive voice service. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Logan^s invention such that it converts each of at 
least two voice commands to text and determines text versions of the at least two voice 
commands by comparing words in the text with phrase grammar rules, to provide 
pronunciation of words and phrases, and place markers in the text, to control interactive 
voice service (column 16, lines 17-20). 

Regarding claim 19, Logan a system for tracking time using speech recognition, 
the system comprising: 

a computer system comprising: 

a memory (mass storage device) that stores computer-readable code (records 
programming; column 3, lines 1-18); and 

a processor operatively coupled to the memory, the processor configured to 
implement the computer-readable code (column 3, lines 1-9 and column 19, lines 11- 
17), the computer-readable code configured to: 

access speech data (audio speech file; column 2, line 57); 

recognize at least two voice commands ("Go", "Five", "News", etc.) from the 
speech data, each voice command occurring at a different time (shift to different 
segment; column 12, lines 55-60); 

determine a first time (start time) associated with a first of the voice commands 
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(advertising segment; column 11, lines 41-51); and 

determine a second time (end time) associated with a second of the voice 
commands (advertising segment; column 11, lines 41-51), but lacks converting each of 
at least two voice commands to text and determining text versions of the at least two 
voice commands by comparing words in the text with phrase grammar rules. 

Ladd discloses converting each of at least two voice commands (column 4, lines 
9-10 and column 5, lines 9-10) to text (column 4, lines 47-49); and 

determining text versions of the at least two voice commands (column 4, lines 
47-49) by comparing words in the text (VRU server compares input) with phrase 
grammar rules (phonetic rules; column 14, lines 25-45), to control interactive voice 
service. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Logan's invention such that it converts each of at 
least two voice commands to text and determines text versions of the at least two voice 
commands by comparing words in the text with phrase grammar rules, to provide 
pronunciation of words and phrases, and place markers in the text, to control interactive 
voice service (column 16, lines 17-20). 

Regarding claim 20, Logan discloses the system wherein the speech data 
comphses a time stamp, and wherein the computer-readable code is further configured 
to: 

when determining a first time: 
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determining an offset time (start and ending offset) between tine time 
stamp and a time when the first voice command (audio presentation) is active (column 
4, lines 48-57); and 

determining (identify) the first time through reference to the time stamp 
and the offset time (column 7, lines 41-45); and 
when determining a second time comprises: 

determining a second offset time (start and ending offset) between the 
time stamp and a time when the second voice command is active (column 4, lines 48- 
57); and 

determining (identify) the second time through reference to the time stamp 
and the second offset time (column 7, lines 41-45). 

Regarding claim 21, Logan discloses the system wherein the computer-readable 
code is further configured to: 

store the time increments (host sever; figure 1, element 101 and column 4, lines 
15-27); and 

place the time increments into a file (form of file (sequence)) having a format 
suitable for importing into a time and billing program (column 15, lines 35-66). 

Regarding claim 22, Logan discloses the system wherein the system further 
comprises a digital personal recorder and wherein the computer-readable code is 
further configured to receive the speech data from the digital personal recorder 
(column 3, lines 1-18). 
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Regarding claim 24, Logan discloses a system for tracking time speech 
recognition, the system comprising: 
a computer system comprising: 

a memory (mass storage device) that stores computer-readable code (records 
programming; column 3, lines 1-18); and 

a processor operatively coupled to the memory, the processor configured to 
implement the computer-readable code (column 3, lines 1-18 and column 19, lines 11- 
17), the computer-readable code configured to: 

access speech data (audio speech file; column 2, line 57) comprising a plurality 
of time stamps and speech (column 42, lines 20-30); 

determine a time (start and ending offset) associated with each of the voice 
commands (audio presentation; column 4, lines 48-57); 

determine a plurality of tasks, each task associated with at least one of the times 
and at least one of the voice commands ("Skip", "Mark etc.; column 14, lines 11-51); 
and 

determine a plurality of time increments (different program segment), each time 
increment comprising one of the tasks (log file) and at least one of the times (segment; 
column 12, lines 55 - column 13, line 2), but lacks converting each of at least two voice 
commands to text and determining text versions of the at least two voice commands by 
comparing words in the text with phrase grammar rules. 

Ladd discloses converting each of at least two voice commands (column 4, lines 
9-10 and column 5, lines 9-10) to text (column 4, lines 47-49); and 
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determining text versions of the at least two voice commands (column 4, lines 
47-49) by comparing words in the text (VRU server compares input) with phrase 
grammar rules (phonetic rules; column 14, lines 25-45), to control interactive voice 
service. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Logan's invention such that it converts each of at 
least two voice commands to text and determines text versions of the at least two voice 
commands by comparing words in the text with phrase grammar rules, to provide 
pronunciation of words and phrases, and place markers in the text, to control interactive 
voice service (column 16, lines 17-20). 

Regarding claim 25, Logan discloses the system wherein the computer-readable 
code is further configured to: 

store the time increments (host sever; figure 1, element 101 and column 4, lines 
15-27); and 

place the time increments into a file (form of file (sequence)) having a format 
suitable for importing into a time and billing program (column 15, lines 35-66). 

Regarding claim 26, Logan discloses the system wherein the system further 
comprises a digital personal recorder and wherein the computer-readable code is 
further configured to receive the speech data from the digital personal recorder 
(column 3, lines 1-18). 
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Conclusion 



5. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Jakieda R Jackson whose telephone number is 
703.305.5593. The examiner can normally be reached on Monday through Friday from 
7:30 a.m. to 5:00p.m. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Talivaldis I. Smits can be reached on 703. 306-301 1 . The fax phone 
number for the organization where this application or proceeding is assigned is 
703.872.9314. 

Any inquiry of a general nature or relating to the status of this application or 
proceeding should be directed to the receptionist whose telephone number is 
703.305.4700. 



JRJ 

May 18, 2004 
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